Recovering a Hidden Community Beyond the Spectral Limit in O(|E|log*|V|) Time
نویسندگان
چکیده
The stochastic block model for one community with parameters n,K, p, and q is considered: K out of n vertices are in the community; two vertices are connected by an edge with probability p if they are both in the community and with probability q otherwise, where p > q > 0 and p/q is assumed to be bounded. An estimator based on observation of the graph G = (V,E) is said to achieve weak recovery if the mean number of misclassified vertices is o(K) as n → ∞. A critical role is played by the effective signal-to-noise ratio λ = K2(p− q)2/((n−K)q). In the regime K = Θ(n), a näıve degree-thresholding algorithm achieves weak recovery in O(|E|) time if λ → ∞, which coincides with the information theoretic possibility of weak recovery. The main focus of the paper is on weak recovery in the sublinear regime K = o(n) and np = n. It is shown that weak recovery is provided by a belief propagation algorithm running for log∗(n)+O(1) iterations, if λ > 1/e, with the total time complexityO(|E| log∗ n). Conversely, no local algorithm with radius t of interaction satisfying t = o( logn log(2+np) ) can asymptotically outperform trivial random guessing if λ ≤ 1/e. By analyzing a linear message-passing algorithm that corresponds to applying power iteration to the non-backtracking matrix of the graph, we provide evidence to suggest that spectral methods fail to provide weak recovery if λ ≤ 1.
منابع مشابه
Recovering a Hidden Community in a Preferential Attachment Graph
Abstract—A message passing algorithm is derived for recovering a dense subgraph within a graph generated by a variation of the Barabási-Albert preferential attachment model. The estimator is assumed to know the arrival times, or order of attachment, of the vertices. The derivation of the algorithm is based on belief propagation under an independence assumption. Two precursors to the message pas...
متن کاملSpectral Estimation of Stationary Time Series: Recent Developments
Spectral analysis considers the problem of determining (the art of recovering) the spectral content (i.e., the distribution of power over frequency) of a stationary time series from a finite set of measurements, by means of either nonparametric or parametric techniques. This paper introduces the spectral analysis problem, motivates the definition of power spectral density functions, and reviews...
متن کاملar X iv : 0 80 3 . 09 29 v 4 [ cs . D S ] 1 8 N ov 2 00 9 Graph Sparsification by Effective Resistances ∗
We present a nearly-linear time algorithm that produces high-quality spectral sparsifiers of weighted graphs. Given as input a weighted graph G = (V, E, w) and a parameter ! > 0, we produce a weighted subgraph H = (V, Ẽ, w̃) of G such that |Ẽ| = O(n log n/!) and for all vectors x ∈ R (1 − !) ∑ uv∈E (x(u) − x(v))2wuv ≤ ∑ uv∈Ẽ (x(u) − x(v))2w̃uv ≤ (1 + !) ∑ uv∈E (x(u) − x(v))2wuv. (1) This improves...
متن کاملSemidefinite Programs for Exact Recovery of a Hidden Community
We study a semidefinite programming (SDP) relaxation of the maximum likelihood estimation for exactly recovering a hidden community of cardinality K from an n× n symmetric data matrix A, where for distinct indices i, j, Aij ∼ P if i, j are both in the community and Aij ∼ Q otherwise, for two known probability distributions P and Q. We identify a sufficient condition and a necessary condition fo...
متن کاملSubmatrix localization via message passing
The principal submatrix localization problem deals with recovering a K ×K principal submatrix of elevated mean μ in a large n × n symmetric matrix subject to additive standard Gaussian noise. This problem serves as a prototypical example for community detection, in which the community corresponds to the support of the submatrix. The main result of this paper is that in the regime Ω( √ n) ≤ K ≤ ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1510.02786 شماره
صفحات -
تاریخ انتشار 2015